A FAX Reader for the Blind
نویسندگان
چکیده
We describe an experiment in which a blind person places arbitrary printed pages in a FAX machine and soon afterwards hears the contents read out loud over the telephone by a synthesized voice. This study focuses on technical issues including the accuracy achievable by stateof-the-art optical character recognition operating on FAX images, methods to improve the intelligibility of synthesized speech for this application, and ease of interaction with the user. In a small-scale trial of the service under laboratory conditions, we have observed that accuracy and intelligibilit y on some commonlyoccurring types of documents, including typewritten letters, is usefully high. We believe that technology developments in the near future will support services acceptable to a wide range of users. 1. Introduct ion We descr ibe a small-sca le exploratory study of a new telecommunica tions ser vice conce pt directe d at visually-impaired populations, to rea d hardcopy printed text out loud over the telephone fully automatically. This re quires integrating state-of -the-a rt technologies for voice, data, and image proce ssing. The ser vice ac cepts images of documents using fa csimile transmission (FAX) , translates the image to ASCII text using optical char ac ter rec ognition (OCR), and conver ts the text to synthesized speec h (TT S). Due to the ubiqito usness of TouchTone telephones and FAX machines, such a servic e could enable visually-impaired people to cope with paper copies of documents and letters in a convenient and inexpensive manner , both at work and at home. We have car ried out a smallscale fe asibilit y trial under labora tory conditions, focusing on technica l issues including the acc urac y ac hievable by state-of -theart OCR opera ting on FAX images, methods to improve the intelli gibil ity of TTS in this application, and ease of interac tion with the user. The results of the study show that the ser vice conce pt is both technica lly fe asible and useful to the blind using curr ently-ava ilable technology. As a guide to future work, we propose a set of enhanc ements, principally in human-f actor s design and OCR ac cura cy, many of which should be straightforward to implement, and a fe w of which may re quire fur ther re sear ch. 24th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, California, November 5-6, 1990. 1990 Maple Press
منابع مشابه
Evaluation of Ferrous-Agarose-Xylenol Gel Properties in Radiation Dosimetry
Background: Over recent decades, modern protocols of external beam radioÂtherapy and radiation techniques such as intensity-modulated radiotherapy (IMRT) have been developed. These methods are extremely sensitive to errors in treatment delivery, so that it is essential to apply a high resolution 3D dosimetry system that has high sensitivity and is capable of measuring and verifying the complex...
متن کاملFrom ShopTalk to ShopMobile: Vision-Based Barcode Scanning with Mobile Phones for Independent Blind Grocery Shopping
Independent grocery shopping is a major challenge for many visually impaired (VI) individuals [1]. In 2006, we began our work on ShopTalk, a wearable system for independent blind supermarket shopping [2]. ShopTalk consisted of a small OQO computer, a wireless barcode reader, and ...
متن کاملThe Divining Reader: A Construct Based on the Bibliomantic Approach to Hafez’s Divan
Hafez Shirazi was a distinguished Persian poet. His poetry collection, Divan, is regarded as a literary work of profound significance. Iranians view this collection as something much more than poetry because it is also used for bibliomantic purposes. After studying Hafez in his social context and exploring distinctive qualities of his Divan, particularly its application as a divination tool, th...
متن کاملWhat Frustrates Screen Reader Users on the Web: A Study of 100 Blind Users
In previous research, the computer frustrations of student and workplace users have been documented. However, the challenges faced by blind users on the Web have not been previously examined. In this study, 100 blind users, using time diaries, recorded their frustrations using the Web. The top causes of frustration reported were (a) page layout causing confusing screen reader feedback; (b) conf...
متن کاملA Novel Blind Watermarking of ECG Signals on Medical Images Using EZW Algorithm
Introduction:In this study, ECG signals have been embedded into medical images to create a novel blind watermarking method. The embedding is done when the original image is compressed using the EZW algorithm. The extraction process is performed at the decompression time of the watermarked image. Materials and Methods: The multi-resolution watermarking with a secret key algorithm developed in th...
متن کاملHow Blind Users' Mental Models Affect Their Perceived Usability of an Unfamiliar Screen Reader
This study investigates blind users’ mental models of Windows environment and their strategies in coping with new desktops and applications. The relationship between users’ mental model and their perceived usability problems when using an unfamiliar screen reader is also reported. Blind users in this study possess a functional or structural mental model or a combination of thereof. They also ha...
متن کامل